EFFICIENT QUEUE-BASED CSMA WITH COLLISIONS 
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Recently there has been considerable interest in the design of effi- 
cient carrier sense multiple access (CSMA) protocol for wireless net- 
work starting works by [TD] [TT] [S] [T^] [Z] • The basic assumption under- 
lying these results is availability of perfect carrier sense information. 
This allows for design of continuous time algorithm under which col- 
lisions are avoided. 

The primary purpose of this note is to show how these results can 
be extended in the case when carrier sense information may not be 
perfect, or equivalently delayed. Specifically, an adaptation of algo- 
rithm in [111 112] is presented here for time slotted setup with carrier 
sense information available only at the end of the time slot. To es- 
tablish its throughput optimality, in additon to method developed 
in [111 112] , understanding properties of stationary distribution of a 
certain non-reversible Markov chain as well as bound on its mixing 
time is essential. This note presents these key results. 

A longer version of this note will provide detailed account of how 
this gets incorporated with methods of [111 112] to provide positive 
recurrence of underlying network Markov process. In addition, these 
results will help design optimal rate control in conjuction with CSMA 
in presence of collision building upon method of [7] . 

1. Setup. We consider a single-hop wireless network of n queues. Queues 
receive work as per exogeneous arrivals and work leaves the system upon 
receiving service. Time is slotted and indexed by r € {0, 1, . . . }. Arrival 
process is assumed to be discrete time and brings unit sized packets. Let 
Qi{r) £ N be number of packets waiting at the ith queue in the begining 
of time slot r. Let Ai(r) be the total number of packets arrived to queue i 
till the end of time slot r. For convenience, we shall assume that in a given 
time slot, arrivals happen at the end of the time slot. Also assume -Ai(-) is 
a Bernoulli i.i.d. process with rate Aj, i.e. Aj = Pr(^4j(r) — Aj(r — 1) = 1) 
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and Ai(r) - Ai(r - 1) G |0, 1} for all i, r > 1. Let Q(r) = [<5i(r)]i<i<„ and 
initially r = 0, Q(0) = (0 

The work from queues is served at the unit rate, but subject to inter- 
ference constraints. Specifically, let G = (V, E) denote the inference graph 
between the n queues, represented by vertices V = {1, . . . n} and edges -E: an 
G -E 1 implies that queues i and j can not transmit simultaneously since 
their transmission interfere with each other. Formally, let crj(r) G {0, 1} de- 
notes whether the queue z is transmitting at time r, i.e. work in queue i is 
being served at unit rate at time r and <r(r) = [(Tj(r)]. Then, it must be 
that for r G N, 

er(r) G X(G) = {p = [#] G {0, 1}" : Pi + p j < 1 for all G E}. 

We shall assume that if a non-empty queue i is served in time slot r, i.e. 
Qi{t) > 1 and <Tj(r) = 1 then a packet departs from it near the end of the 
time slot r, but before arrival happens. In summary, queueing dynamics: for 
any r > and 1 < i < n, 

Qi(r + 1) = Qi(r) - 0-i(T)I{g.( T )> O } + i4i(r). 

1.1. Scheduling constraints. The scheduling algorithm decides the sched- 
ule <r(r) G X(G) in the begining of each time slot, possibly using Q(r) and 
past history. This decision is made in a distributed manner by nodes. Specif- 
ically, in the beginning of each time slot, each node makes a decision to 
transmit or not. At the end of the time slot, node knows the following: 

o if it attempted to transmit, whether its attempt was successful; 
o if it did not attempt to transmit, whether any of its neighbor attempt- 
ing to transmit was successful. 

In summary, each node has delayed carrier sense information that is available 
at the end of the time slot. 

1.2. Capacity region. From the perspective of network performance, we 
would like the scheduling algorithm to be such that the queues in network 
remain as small as possible for the largest possible range of arrival rate 
vectors. To formalize this notion of performance, we define the capacity 
region. Let A be the capacity region defined as 

A = Conv(X(G)) 

(1) = | 1/ £ : 1/ < ^2 a °- <T ' wittl - °> and a<T - 1 f ' 

[ o-gX(G) o-eX(G) J 

1 Bold letters are reserved for vectors; 0, 1 represent vectors of all 0s & all Is respectively. 
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Definition 1 (throughput optimal) A scheduling algorithm is called 
throughput optimal, or stable, or providing 100% throughput, if for any 
A € A° the (appropriately defined) underlying network Markov process is 
positive (Harris) recurrent. 

2. Our algorithm. We present a randomized algorithm that is direct 
adaptation of the algorithm in [TTJ [12] for the discrete time setting. 

In the beginning of each time slot, say r, each node (or queue) does 
the following. With probability 1/2, independent of everything else, it does 
nothing. Otherwise, it executes the following: 

1. If o"j(r — 1) = 1, that is its transmission at time r — 1 was successful, 
then it decides to transmit with probability 1 — W ^ T ^ ■ 

2. If at time r — 1, any of its neighbor's transmission was successful, then 
does not attempt to transmit with probability 1. 

3. Otherwise, it attempts transmission with probability 1. 

Few remarks about the algorithm. In case 1, we choose 

(2) Wi(r) = exp (max {f(Q l (r)), V / /(Qmax(r))}) , 

where / : K+ — > [0, oo) is a strictly increasing function with /(0) = 0, 
lim^oo f(x) = oo and satisfies the property 

lim exp(/(x)) • /' (r 1 (5 f{x))) = 0, for any 5 G (0,1). 

X— >oo 

For example, any strictly increasing function with /(0) = and f{x) = 
o(logx) will have this property, e.g. f(x) = ^J\og(x + 1), loglog(x + e), etc. 

In above Q m ax( - ) = maxjQj(-), that is the maximum of all queue sizes. 
Of course, knowing this instantly is not possible. However, knowledge of 
Qmax( - ) ± 0(1) suffices and a simple scheme to achieve this is presented in 
|llj . Of course, authors strongly believe that explicit information exchange 
for knowing such an estimate is needed. 

Finally, it is assumed that if a node tries to attempt as part of the above 
algorithm, then it must send some data irrespective of the value of Qi(t). 

3. Properties of algorithm. To establish throughput optimality of 
the algorithm described above building upon method of PT2] will require 
us to understand property of the stationary distribution of a certain Markov 
chain of the space of independent sets 1(G) as well as its mixing time. We 
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study these two properties here. Relation of this Markov chain to algorithm 
of Section [2] is explained. 

As mentioned earlier, a longer version of this note will provide detailed 
proof of throughput optimality using these properties. 

3.1. A Markov chain & its mixing time. Consider a graph G = (V,E) 
of n = \V\ nodes with node weights W = [Wj] G M> 1 where R>i = {x G 
R : x > 1}. We consider Markov chain on the space of independent sets of 
G, 1(G) based on W with certain qualitative properties. In what follows 
we define what are feasible transitions as part of the chain and provide 
properties of the corresponding transition probabilities. This may not lead 
to an exact definition of the Markov chain, i.e. a class of Markov chains 
can satisfy these properties. However, as we shall show that all Markov 
chains with these properties have desired properties in terms of stationary 
distribution and their mixing times. 

Now we describe what sorts of transitions are allowed and properties 
of the corresponding transition probabilities. Suppose the Markov chain is 
currently in the state cr G Z(G). With abuse of notation, let cr denote the 
subset of V that {i G V : o~i = 1}. Then, under the Markov chain of interest, 
transition from cr to cr' is allowed if and only if cr' = aUS2\S\ where S\ C cr 
and S2 C V such that cr U S2 G 1(G). The probability of this transition, say 
Paa' is such that 



where 2" n < p(S 2 ) < 1. Let P = [P CTCT /] G [0, l]P(G)xX(G)| denote the transi- 
tion probability matrix. 

Under this Markov chain, there is strictly positive probability to reach 
empty set, 0, from any other state cr G 1(G) and vice versa; empty set has 
a self loop. Therefore, the Markov chain is irreducible, aperiodic. It is finite 
state and hence it has unique stationary distribution, say 7r. We claim the 
following two properties of the Markov chain P: first is about 7r and the 
second is about its mixing time. 

Lemma 1 For any W G R>i, 



Proof. To start with, it is clear that the stationary distribution 7r of the 
Markov chain P has 1(G) as its support. That is, 7r = [7Ter]ereX(G) with 
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Ho- > for all a G 1(G). Therefore, we can write 
(3) tto- oc exp(J7(cr)) , 

M + where K + = {i £ M : i > 0}. We will show that 



for some U : 1(G) 
for all a € 1(G) 

(4) 



O n2 n . 



Assuming @, we shall conclude the result of Lemma [TJ For this, we wish 
to utilize the following proposition that is a direct adaptation of the known 
results in literature (cf. [5] or see [12]). 



and let Ai(Q) be space of all distributions 



Proposition 2 Let T : $7 - 

on n. Define F : M(tt) -)■ M as 



F(n)=E tl (T(x)) + H ER (n), 

where Her(^) is the standard discrete entropy of /x. Then, F is uniquely 
maximized by the distribution u, where 

f x oc exp (T(x)) , for any x G Q. 

Further, with respect to v , we have 



E„[T(x)] > 



maxT(x) 



log |^| 



Now by applying Proposition [2] with v replaced by tv, 0, replaced 1(G) and 
T replaced by F, we have that 



K„ [F(a)] > 



(5) 



> 



max F(a) 
max F(a) 



log|X(G)| 



n, 



since \1(G)\ < 2 n . Using g]) and ©, it follows that 



(6) 



E, 



^ cri log 



> 



max y^pjlogWj 



O n2 n 



To complete the proof of Lemma [U we shall establish the remaining claim 
@. To this end, consider a different Markov chain on 1(G) with transition 
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probability matrix Q = [Qa-a- 1 ] such that Qo-o-' > if and only if Pcra' > 0. 
Now if Pa-a-' > 0, then it must be that there are S± C cr, S2 C V so that 
cr U S2 € 1(G) and in this case, we define Qa-a-' as 

^ q™> « h n w- 

Thus, we have that for all cr, a' 6 2^(G) with P aa i , Qa-a-' > 0, 

(8) 2~ n < < 1. 

It can be checked that Q, like P, is irreducible and aperiodic Markov chain 
on 1(G). Let 7? be the unique stationary of Q on 1(G). We claim that 



(9) tt^ oc 



J] Wi = exp f J^o-ilogWij . 

:<Ti=l \ i / 



To establish this, note that if transition from cr to cr' is feasible under Q 
(equivalently under P) then so is from cr' to a. Specifically, let cr = So U S± 
and cr' = S0US2, where So, Si, S2 are disjoint sets and SoUSiUS^ (= crUS2) 
is an independent set of G. Then, 



7T n 



n * >< iU 

n w i ) x 2 ~ n 



n w, U n ^ 



(10) = 7Tcr'Qe 



x 2" 



The (jlOh establishes that Q is reversible and satisfies detailed balance equa- 
tion with 7? as its stationary distribution. This establishes ([U]). 

Given Q, to establish @ as desired, it is sufficient to show that for any 
a e 1(G), 

(11) 2-" 2 " < ^ < 2™ 2 ". 

7T,t 
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To establish this, we shall use the characterization of stationary distributions 
for any irreducible, aperiodic finite state Markov chain given through what 
is known as the 'Markov chain tree theorem' (cf. see pQ). To this end, define 
a directed graph Q = (1(G), £) with 1(G) as vertices and directed edge 
(a, a') G £ if and only if Po-,cr> > (equivalently Q^^ 1 > 0). Let To- be the 
space of all directed spanning trees of Q rooted at a G 1(G). Define weight 
of a tree T G To- with respect to transition matrix P, denoted as w(T,P), 
as 

w(T,P)= Yl P Pt „. 
(p,p')eT 

Similarly, define weight of T G To- with respect to Q, denoted as w(T, Q), as 

w(T,Q)= J] Q Pyp/ . 

Then, the Markov Tree Theorem states that for any cr G 1(G), 

(12) w(T,P). 

TeTa- 

And, similarly for cr £l(G), 

(13) 7T CT CX J2 W ( T >Q)- 

TeTcr 

Since the number of edges in each spanning tree is no more than |X(G)| < 2 n , 
by ©, CE2D and CE]), it follows that for all a G 1(G) 

(14) 2~ n2 " < ^ <2 n2 \ 

This completes the proof of[TT]and subsequently that of Lemma [TJ □ 

Now we will obtain a mixing rate (or time) of the non-reversible Markov 
chain P. To this end, we present a bound of the matrix norm of P* since it 
crucially determines the mixing rate of P (cf. [9]). Here, P* is the adjoint 
matrix of P and the matrix norm IIP* II is defined as 



|P*|| = sup 



|P*v|| 97r 

I IIZ,7T 



v:EJvl=0 Il v ll2,7r 



where \\u\\ 2) n = JY^aei(G) ^(""o-) 2 for an Y u € R |I(G)I . 
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Lemma 3 Given P described above, let P* be its adjoint. Then, 

(1^) \\P*\\ < 1 i 

1 ' II II - 2 4 ™( 2n + 2 )+ 2 (W max ) 4 "' 

Proof. We shall use Cheeger's inequality to bound spectral gap for reversible 
Markov chain defined by PP* and then use it to bound \\P* || using standard 
result (cf. [9]). 

To that end, let A2 and Aix(C)| t> e the second-largest and smallest eigen- 
values of PP* , respectively! 2 ] It is known [9] that 

\\ p *\\ = y / max{|A 2 |, |A| X(G) ||} 

= max { Vl - (1 - |A 2 |), ^1-(1-|A| X(G) ||)} 

(16) <maxjl-i^M,l-i^^}. 

First observe that PP* > (component-wise) since P, P* > ^I. From 

this, it is easy to check that 

(17) 

X \i(G)\ > 2 x ^ - 1. 

Therefore, it suffices to obtain the bound of A2 for the desired bound of \\P* \\ 
in Lemma [3l In general, in the absence of such bound one can use 'lazy' 
version of the Markov chain, i.e. add self loop to all states with probability 
1/2, to make all eigenvalues non- negative and hence need to bound A2 only. 

Next, we will use the Cheeger's inequality [21 SI El El [T3] , it is well known 
that 

$2 

\ < 1 . 

2 

Here, $ is the conductance of R := PP* , defined as 

s, ■ Q(s,s c ) 

<I> = mm 



ScX(G) min{7r(S'),7r(5 c )}' 

where S c = 1(G)\S, Q{S,S C ) = E^eS^'eS* tt(<t)R(<t, <t'). Now we will 
consider the following naive bounds for ir and R to derive the desired bound 



2 PP* is reversible, hence all eigenvalues are real and in the interval [—1, 1]. 



RANDOMIZED NETWORK SCHEDULING 



9 



of and A2. 



(0) 1 



(6) 1 

> x 



(18) 



2-( 2 "+D(W max )«' 

where (a) and (b) follows from (|14p and ([9]), respectively. In addition, 

min R(cr,cr') > min P(cr,cr') x min P*(cr,cr') 

R(<t,<t')^0 P(<t,<t')^0 P*(<t,<t')^0 

> min P(a,a') x ( min 7TO- x min P(a,a') } 
P(er,cr')^0 V " 61 ^) P(<t,<t')^0 / 

> — x f 1 X — 

- 2™ X \2 n ( 2 "+ 1 )(W f max )' 1 X 2™ 

(19) = 2"(2"+3)(W max )"- 

Now by the standard application of these bounds (fLSj) and (fl9j) , we obtain 

$ > min Q(5, S c ) 

ScZ(G) 

> min iTo-RffT, cr ) 

P((t,<t')^0 

> min x min R(a,a') 
a&T(G) R(cr,cr')jtO 

(20) > 



2 2n(2"+2)( W - n 



\2n 



Therefore, from the Cheeger's inequality and (|2U|) 
(21) 

1 - A 2 > <5> 2 /2 > 



2 4n(2"+2)+l( W] 



An 

max y 



The desired bound of ||P*|| follows from (|16p . (|17p . f)21 1) and the property 
Wmax > 1 • This completes the proof of Lemma [3J □ 

3.2. Relation to Algorithm. Here is a quick explanation of why Markov 
chain P described in Section 13.11 arises naturally as part of the algorithm 
described in Section [2J To that end, the weight vector W = W(r) is time 
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varying and function of Q(t) as per ([2]). And, transition of the set of success- 
fully transmitting nodes <x(r — 1) at time slot r — 1 to the set of successfully 
transmitting nodes <x(r) at time r is as per the transition matrix P = -P(r), 
where P = P(t) has properties described above with weight W = W(t). 

To see this, consider the <t(t — 1). Then, a subset Si C cr(r — 1) can 
decide to stop transmitting at time r and these decisions are taken with 
probability proportional to Oigs 1 ! WIT)' Clearly, no nodes in neighborhood 
of <r(r — 1) will attempt transmission as per the algorithm. Therefore, new 
nodes attempting transmission must be such that they are not neighbors of 
any of the nodes in <x(r— 1). For any subset S2 such that <x(r— 1)US , 2 £l(G), 
it is possible to have <x(r) include S^- This is because, nodes in S2 attempt 
transmission and all of their neighbors (which by definition are not part 
of <t(t — 1)) do not attempt transmission - this happens with probability 
proportional to 2 _ ' 52 '~' r ( s ' 2 )' , where r(S < 2) are neighbors of S^- Indeed, for 
<t(t) to transit exactly to <x(r — 1) U S2\S±, the overall probability can be 
argued in a similar manner to be proportional to the following: 



2 n < p'(S2) < 1- This completes the explanation of relation between the 
algorithm and the Markov chain described in Section 13.11 
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